首页> 外文OA文献 >Performance optimization for managing massive numbers of small files in distributed file systems
【2h】

Performance optimization for managing massive numbers of small files in distributed file systems

机译:性能优化,用于管理分布式文件系统中的大量小文件

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The processing of massive numbers of small files is a challenge in the design of distributed file systems. Currently, the combined-block-storage approach is prevalent. However, the approach employs the traditional file systems such as ExtFS and may cause inefficiency when accessing small files randomly located in the disk. This paper focuses on optimizing the performance of data servers in accessing massive numbers of small files. We present a Flat Lightweight File System (iFlatLFS) to manage small files, which is based on a simple metadata scheme and a flat storage architecture. iFlatLFS is designed to substitute the traditional file system on data servers and can be deployed underneath distributed file systems that store massive numbers of small files. iFlatLFS can greatly simplify the original data access procedure. The new metadata proposed in this paper occupies only a fraction of the metadata size based on traditional file systems. We have implemented iFlatLFS in CentOS 5.5 and integrated it into an open source Distributed File System (DFS), called Taobao FileSystem (TFS), which is developed by a top B2C service provider, Alibaba, in China and is managing over 28.6 billion small photos. We have conducted extensive experiments to verify the performance of iFlatLFS. The results show that when the file size ranges from 1KB to 64KB, iFlatLFS is faster than Ext4 by 48% and 54% on average for random read and write in the DFS environment, respectively. Moreover, after iFlatLFS is integrated into TFS, iFlatLFS-based TFS is faster than the existing Ext4-based TFS by 45% and 49% on average for random read access and hybrid access (the mix of read and write accesses), respectively.
机译:在分布式文件系统的设计中,处理大量小文件是一个挑战。当前,组合块存储方法是普遍的。但是,该方法采用了传统的文件系统,例如ExtFS,并且在访问随机位于磁盘中的小文件时可能会导致效率低下。本文着重于在访问大量小文件时优化数据服务器的性能。我们提出了一种扁平轻量级文件系统(iFlatLFS),用于管理小文件,该系统基于简单的元数据方案和扁平存储体系结构。 iFlatLFS旨在替代数据服务器上的传统文件系统,并且可以部署在存储大量小文件的分布式文件系统下。 iFlatLFS可以大大简化原始数据访问过程。本文提出的新元数据仅占传统文件系统元数据大小的一小部分。我们已经在CentOS 5.5中实现了iFlatLFS,并将其集成到名为“淘宝文件系统”的开源分布式文件系统(DFS)中,该文件系统由中国领先的B2C服务提供商阿里巴巴开发,目前管理着286亿张小照片。我们进行了广泛的实验,以验证iFlatLFS的性能。结果表明,当文件大小在1KB到64KB之间时,对于DFS环境中的随机读取和写入,iFlatLFS的速度分别比Ext4快48%和54%。此外,在将iFlatLFS集成到TFS中之后,基于iFlatLFS的TFS相对于现有的基于Ext4的TFS分别在随机读取访问和混合访问(读取和写入访问的混合)方面分别快45%和49%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号